Add force unmount after timeout option #1710
Open
+48
−25
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Is this a bug fix or adding new feature?
Adding new feature.
What is this PR about? / Why do we need it?
The
force-unmount-after-timeout
feature addresses issues whenNodeUnpublishVolume
gets called infinite times and hangs indefinitely due to broken NFS connections. Kubelet will keep trying RPCNodeUnpublishVolume
and ultimately cause OOMKilled for the pod.When enabled, if a normal unmount operation exceeds the configured timeout, the driver will forcibly unmount the volume to prevent indefinite hanging and allow the operation to complete. By default, the timeout limit is 30 seconds while user can also specify the timeout duration.
What testing is done?